Embedding Knowledge in Web Documents

نویسندگان

  • Philippe Martin
  • Peter W. Eklund
چکیده

The paper argues for the use of general and intuitive knowledge representation languages (and simpler notational variants, e.g. subsets of natural languages) for indexing the content of Web documents and representing knowledge within them. We believe that these languages have advantages over metadata languages based on the Extensible Mark-up Language (XML). Indeed, the retrieval of precise information is better supported by languages designed to represent semantic content and support logical inference, and the readability of such a language eases its exploitation, presentation and direct insertion within a document (thus also avoiding information duplication). We advocate the use of Conceptual Graphs and simpler notational variants that enhance knowledge readability. To further ease the representation process, we propose techniques allowing users to leave some knowledge terms undeclared. We also show how lexical, structural and knowledge-based techniques may be combined to retrieve or generate knowledge or Web documents. To support and guide the knowledge modeling approach, we present a top-level ontology of 400 concept and relation types. We have implemented these features in a Web-accessible tool named WebKB (http://meganesia.int.gu.edu.au/ ̃phmartin/WebKB/), and show examples to illustrate them.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Document Embedding Method for News Classification

Abstract- Text classification is one of the main tasks of natural language processing (NLP). In this task, documents are classified into pre-defined categories. There is lots of news spreading on the web. A text classifier can categorize news automatically and this facilitates and accelerates access to the news. The first step in text classification is to represent documents in a suitable way t...

متن کامل

Embedding Knowledge in Web Documents: CGs versus XML-based Metadata Languages

The paper argues for the use of general and intuitive knowledge representation languages for indexing the content of Web documents and representing knowledge within them. We believe these languages have advantages over metadata languages based on the Extensible Markup Language (XML). Indeed, the representation and retrieval of precise information is better supported by languages designed to rep...

متن کامل

Analyzing the Collaboration Network of Global Scientific Outputs in the Field of Bibliotherapy in the Web of Science Database

Background and Aim: Bibliotherapy is a useful treatment for the prevention and treatment of mental disorders and has led to the formation of many scientific publications in this field. The purpose of this study was to investigate the publication trends in the field of bibliotherapy and visualize the structure of its scientific collaborations based on the Web of Science database during the perio...

متن کامل

بررسی تولیدات علمی در زمینه حقوق بیماران در عرصه بین‌المللی نمایه شده در پایگاه Web of Science بین سالهای 2000 تا 2014

Introduction: One of the criteria showing the importance of a research area is the scientific products in that research area. The aim of the current study was to investigate the situation of scientific products on the topic of Patients’ rights indexed in ISI-Web of Science between the years 2000 until 2014. Methods: The method used was descriptive-cross sectional with a Scientometrics...

متن کامل

Embedding , reorganization and construction of mathematical and didactical contents as an objective in teachers education

MaDiN (Ma)thematics and (Di)dactics (N)etwork is a fast growing knowledge base containing material about teaching mathematics (URL: http://visum2.uni-muenster.de ). Four universities in Germany participate in this project developing contents in different branches of primary, lower and upper secondary student teachers education in mathematics. The project is funded by the German ministry of scie...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computer Networks

دوره 31  شماره 

صفحات  -

تاریخ انتشار 1999